Search Result

Select

Reducing Multi-model Biases for Robust Visual Question Answering

ZHANG Fengshuo, LI Yu, LI Xiangqian, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2024, 60 (1): 23-33. DOI: 10.13209/j.0479-8023.2023.072

Abstract （82）

HTML

PDF（pc）（5709KB）（50）

Save

In order to enhance the robustness of the visual question answering model, a bias reduction method is proposed. Based on this, the influence of language and visual information on bias effect is explored. Furthermore, two bias learning branches are constructed to capture the language bias, and the bias caused by both language and images. Then, more robust prediction results are obtained by using the bias reduction method. Finally, based on the difference in prediction probabilities between standard visual question answering and bias branches, samples are dynamically weighted, allowing the model to adjust learning levels for samples with different levels of bias. Experiments on VQA-CP v2.0 and other data sets demonstrate the effectiveness of the proposed method and alleviate the influence of bias on the model.

Related Articles | Metrics | Comments（0）

Select

Word-Based Domain Feature-Sensitive Multi-domain Neural Machine Translation

HUANG Zengcheng, MAN Zhibo, ZHANG Yujie, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2023, 59 (1): 1-10. DOI: 10.13209/j.0479-8023.2022.063

Abstract （776）

HTML

PDF（pc）（802KB）（240）

Save

The accuracy of the existing word-based domain feature learning methods on domain discrimination is still low and the further research for domain feature learning is required. In order to improve domain discrimination and provide accurate translation, this paper proposes a word-based domain feature-sensitive learning mechanism, including 1) the context feature encoding at encoder side, to widen the study range of word-based domain features, introducing convolutional neural networks (CNN) in encoder for extracting features from word strings with different lengths in parallel as word context features; and 2) enhanced domain feature learning. A domain discriminator module based on multi-layer perceptions (MLP) is designed to enhance the learning ability of obtaining more accurate domain proportion from word context features and improve the accuracy of word domain discrimination. Experiments on English-Chinese task of UM-Corpus and English-French task of OPUS show that the average BLEU scores of the proposed method exceed the strong baseline by 0.82 and 1.06 respectively. The accuracy of domain discrimination is improved by 10.07% and 18.06% compared with the baseline. More studies illustrate that the improvements of average BLEU scores and accuracy of domain discrimation are contributed by the proposed word-based domain feature-sensitive learning mechanism.

Related Articles | Metrics | Comments（0）

Select

Incorporating Clause Alignment Knowledge into Chinese-English Neural Machine Translation

MIAO Guoyi, LIU Mingtong, CHEN Yufeng, XU Jin’an, ZHANG Yujie, FENG Wenhe

Acta Scientiarum Naturalium Universitatis Pekinensis 2022, 58 (1): 61-68. DOI: 10.13209/j.0479-8023.2021.111

Abstract （588）

HTML

PDF（pc）（729KB）（103）

Save

Currently, neural machine translation (NMT) is insufficient in capturing the semantic and structural relationships between clauses in complex sentences, which often results in poor discourse coherence of long and complex sentence translation. To address this problem, the paper proposes a Chinese-English NMT approach by integrating the clause alignment knowledge into NMT. Firstly, a labeling scheme combining manual and automatic annotation is introduced to annotate a large-scale clause aligned Chinese-English parallel corpus that provides rich clause-level Chinese-English bilingual alignment knowledge for model training. Then, a NMT model is designed based on clause alignment learning for enhancing the ability of the model to learn the semantic structure relationships between clauses within complex sentences. Experimental results on WMT17, WMT18 and WMT19 Chinese-English translation tasks demonstrate that proposed method can significantly improve the NMT performance. Evaluation and analysis show that proposed method can effectively improve the discourse coherence of complex sentence in Chinese-English machine translation.

Related Articles | Metrics | Comments（0）

Select

Multi-modality Paraphrase Generation Model Integrating Image Information

MA Chao, WAN Zhang, ZHANG Yujie, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2022, 58 (1): 45-53. DOI: 10.13209/j.0479-8023.2021.110

Abstract （634）

HTML

PDF（pc）（15716KB）（68）

Save

In multi-modality scenarios such as commodity descriptions and news comments, existing paraphrase generation models can not utilize information from image and therefore result in the loss of semantics in the generated paraphrases. In order to solve this problem, this paper first propose the Multi-modality Paraphrase Generation (MPG) model to integrate image information for paraphrase generation. In MPG, in order to integrate the image information corresponding to the original sentence, the authors first construct an abstract scene graph and transform the image features into node features of the scene graph. Furthermore, the constructed scene graph was utilized to generate paraphrase, by using the relational graph convolutional neural network for encoder and graph-based attention mechanism for decoder. In the evaluation stage, a sentence pair similarity calculation method was proposed to select sentence pairs describing same objects from the MSCOCO data set, and then evaluation experiments were conducted. Experimental results show that the proposed MPG model achieve better semantic fidelity, which indicates that the integration of image information is effective in improving the quality of the paraphrase generation in multi-modality scenarios.

Related Articles | Metrics | Comments（0）

Select

A Review of Entity Linking Research Based on Deep Learning

LI Tianran, LIU Mingtong, ZHANG Yujie, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2021, 57 (1): 91-98. DOI: 10.13209/j.0479-8023.2020.077

Abstract （1443）

HTML

PDF（pc）（457KB）（477）

Save

The authors introduce the concept and steps of entity linking in detail, and the problems and research status of named entity linking based on deep learning in recent years, analyze the problems and corresponding solution models of entity linking and present related data sets and evaluation methods. The authors summarize the current status of entity linking in international evaluation conferences and analyze the future research directions.

Related Articles | Metrics | Comments（0）

Select

Unsupervised Syntactically Controllable Paraphrase Network for Adversarial Example Generation

YANG Erguang, LIU Mingtong, ZHANG Yujie, MENG Yao, HU Changjian, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2021, 57 (1): 83-90. DOI: 10.13209/j.0479-8023.2020.079

Abstract （965）

HTML

PDF（pc）（570KB）（183）

Save

Prior work on adversarial example generation with syntactically controlled paraphrase networks requires large-scale paraphrase parallel corpora to train models. The performance of the model is seriously limited by the domain and scale of paraphrase parallel corpus. To solve this problem, this paper proposes an unsuprervised syntactically controlled paraphrase model to generate adversarial examples which only needs monolingual data. Specifically, variational autoencoder is used to learn model, which maps a sentence and a syntactic parse tree into semantic and syntactic variables, respectively. By learning to reconstruct the input sentence from syntactic and semantic variables, the model effectively learns to generate syntactic paraphrases without using any parallel data. Experiment results on unsupervised sentence paraphrasing and adversarial example generation demonstrate that the proposed model achieves new state-of-the-art results on unsupervised paraphrase generation and generate effective adversarial examples. These examples can be used to improve the robustness and generalization of NLP (natural language processing) model.

Related Articles | Metrics | Comments（0）

Select

Object Space Relation Mechanism Fused Image Caption Method

WAN Zhang, ZHANG Yujie, LIU Mingtong, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2021, 57 (1): 75-82. DOI: 10.13209/j.0479-8023.2020.080

Abstract （682）

HTML

PDF（pc）（5984KB）（208）

Save

Focusing on the specific information of the positional relationship between objects in the image, a neural network image summary generation model integrating spatial relationship mechanism is proposed, in order to provide key information (object position or trajectory) for downstream tasks such as visual question answering and voice navigation. In order to enhance the learning ability of the positional relationship between objects of the image encoder, the geometric attention mechanism is introduced by improving the Transformer structure, and the positional relationship between objects is explicitly integrated into the appearance information of the objects. In order to assist in the completion of specific information-oriented extraction and summary generation tasks, a data production method for relative position relations is further proposed, and the image abstract data set Re-Position of the position relations between objects is produced based on the SpatialSense data set. The experimental results of comparative evaluation with five typical models show that the five indicators of the proposed model are better than those of other models on the public test set COCO, and all six indicators are better than those of other models on Re-Position data set.

Related Articles | Metrics | Comments（0）

Select

Research on the Construction and Application of Paraphrase Parallel Corpus

WANG Yasong, LIU Mingtong, ZHANG Yujie, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2021, 57 (1): 68-74. DOI: 10.13209/j.0479-8023.2020.078

Abstract （945）

HTML

PDF（pc）（398KB）（134）

Save

Taking Chinese as the research object, the authors put forward the method to construct large-scale and high-quality paraphrase parallel corpora. The paraphrase data augmentation method include transfering English paraphrase corpus to Chinese, by using the method of translation engines, and manually annotating evaluation data set. Based on the constructed Chinese paraphrase data, the validity of the paraphrase data construction application method is verified in the paraphrase recognition task and natural language inference task. Firstly, the paraphrase recognition data is generated based on the constructed paraphrase corpus, and the attention-based neural network model of sentence matching is pre-trained to capture the paraphrase information. Then, the pre-trained model is applied to the natural language inference task to improve the performance. The experimental results on the open set show that the constructed paraphrase corpus can be effectively applied to the paraphrase recognition task, and the model can learn paraphrase knowledge. When applied to natural language inference task, paraphrase knowledge can effectively improve the accuracy of natural language inference models and verify the effectiveness of paraphrase knowledge for downstream semantic understanding tasks. Meanwhile, the proposed construction method for the paraphrase corpus is language-independent, which can provide more training data for other languages and fields, generate high-quality paraphrase data, and further improve the performance of other tasks.

Related Articles | Metrics | Comments（0）

Select

A Multi-Mechanism Fused Paraphrase Generation Model with Joint Auto-Encoding Learning

LIU Mingtong, ZHANG Yujie, ZHANG Shu, MENG Yao, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2020, 56 (1): 53-60. DOI: 10.13209/j.0479-8023.2019.104

Abstract （964）

HTML

PDF（pc）（828KB）（173）

Save

Neural network encoder-decoder framework has become the popular method for paraphrase generation, but there are still two problems. On the one hand, there are such issues as inaccurate entity words, unknown words and word repetition in the generated paraphrase sentences. To solve the first problem, we proposed a multimechanism fused paraphrase generation model to improve the decoder. The copy mechanism was used to copy words form input sentence for improving the generation of entity and unknown words. The coverage mechanism was used to model historical attention information to avoid word repetition. On the other hand, the limited-scale parallel paraphrase corpus limits the learning ability of the encoder. We proposed to jointly learn auto-encoding task, which shares one encoder with paraphrase generation task. The joint auto-encoding task enhances the learning ability of the encoder. Experimental results on Quora paraphrase dataset show that the multi-mechanism fused paraphrase generation model with joint auto-encoding task can effectively improve the performance of paraphrase generation.

Related Articles | Metrics | Comments（0）

Select

A Neural Paraphrase Identification Model Based on Syntactic Structure

LIU Mingtong, ZHANG Yujie, XU Jin’an, CHEN Yufeng

Acta Scientiarum Naturalium Universitatis Pekinensis 2020, 56 (1): 45-52. DOI: 10.13209/j.0479-8023.2019.092

Abstract （1057）

HTML

PDF（pc）（1016KB）（127）

Save

Paraphrase identification involves natural language semantic understanding. Most previous methods regarded sentences as sequential structures, and used sequential neural network for semantic composition. These methods do not consider the influence of syntactic structure on semantic computation. In this paper, we proposed a neural paraphrase identification model based on syntactic structure, and designed a tree-based neural network model for semantic composition, which extended the semantic representation from word level to phrase level. Furthermore, this paper proposed a syntactic tree alignment mechanism based on phrase-level semantic representation, and extracted features by using cross-sentence attention mechanism. Finally, a self-attention mechanism was used to enhance semantic representation, which could effectively model context information based on syntactic structure. Experiments on Quora paraphrase dataset show that the performance of paraphrase identification has been improved to 89.3% accuracy. The results further prove that the proposed semantic composition method based on syntactic structure, phrase-level cross sentence attention and self-attention are effective in improving paraphrase identification.

Related Articles | Metrics | Comments（0）

Select

Distant Supervision for Relation Extraction with Gate Mechanism

LI Xingya, CHEN Yufeng, XU Jin’an, ZHANG Yujie

Acta Scientiarum Naturalium Universitatis Pekinensis 2020, 56 (1): 39-44. DOI: 10.13209/j.0479-8023.2019.101

Abstract （1143）

HTML

PDF（pc）（699KB）（164）

Save

A piecewise convolutional neural network with gating mechanism is proposed, which would automatically filter positive correlation features at word-level. Moreover, the idea of soft-label is introduced to the gating mechanism to weaken the impact of hard labels on noise filtering. Combined with sentence-level noise filtering, the overall performance of the model is improved. The experimental results on the public dataset show that the proposed model has a significant improvement compared to the sentence-level noise filtering methods.

Related Articles | Metrics | Comments（0）

Select

Categorization on Sponge Measurements in Lingnan Rural Area

JIANG Danning, WANG Zhifang, CHEN Yufeng, ZHANG Zijiao, YANG Shaodong

Acta Scientiarum Naturalium Universitatis Pekinensis 2018, 54 (4): 873-882. DOI: 10.13209/j.0479-8023.2018.023

Abstract （844）

HTML

PDF（pc）（7484KB）（167）

Save

Focusing on historic, cultural and old villages in Guangdong Province, the authors extract and synthesize features and information on sponge measures applied by local villages. Based on aerial map on Google Earth, the pattern of the settlement and water system are observed and categorized. Based on 30 m DEM picture of Guangdong Province in ArcGIS, landscape and the graded water system of the whole province and the elevations of all villages are extracted, followed by inundation analysis. The level of the rain intensity in different parts of the province is also considered. 5 features are coded and input into SPSS. By hierarchical cluster analysis, the villages are classified. The study generalizes the sponge measures of all classifications and concludes with 4 kinds of sponge measures in villages in Lingnan area: terrain utilization, slowed runoff through impoundment, strategically directed runoff, and forced drainage. Suggestions are proposed to promote the application of traditional ecophronesis in modern urban practices in Linghan area.

Related Articles | Metrics | Comments（0）

Select

Integrating of Grapheme-Based and Phoneme-Based Transliteration Unit Alignment Method

LIU Bojia, XU Jin’an, CHEN Yufeng, ZHANG Yujie

Acta Scientiarum Naturalium Universitatis Pekinensis 2016, 52 (1): 75-80. DOI: 10.13209/j.0479-8023.2016.001

Abstract （883）

HTML

PDF（pc）（362KB）（890）

Save

In order to solve the errors caused by only using the pheneme-based method or the grapheme-based method, applying the theory of statistics and rules, this paper proposes a new method for transliteration unit alignment which integrates the two main transliteration methods. Four experiments are designed to compare with the traditional methods. Experimental results show that proposed method outperforms other methods in terms of performance in machine transliteration.

Related Articles | Metrics | Comments（0）